Performance Criteria for Graph Clustering and Markov Cluster

نویسندگان

  • S. van Dongen
  • Stijn van Dongen
چکیده

In 6] a cluster algorithm for graphs was introduced called the Markov cluster algorithm or MCL algorithm. The algorithm is based on simulation of (stochastic) ow in graphs by means of alternation of two operators, expansion and innation. The results in 8] establish an intrinsic relationship between the corresponding algebraic process (MCL process) and cluster structure in the iterands and the limits of the process. Several kinds of experiments conducted with the MCL algorithm are described here. Test cases with varying homogeneity characteristics are used to establish some of the particular strengths and weaknesses of the algorithm. In general the algorithm performs well, except for graphs which are very homogeneous (such as weakly connected grids) and for which the natural cluster diameter (i.e. the diameter of a subgraph induced by a natural cluster) is large. This can be understood in terms of the ow characteristics of the MCL algorithm and the heuristic on which the algorithm is grounded. A generic performance criterion for clusterings of weighted graphs is derived, by a stepwise reenement of a simple and appealing criterion for simple graphs. The most reened criterion uses a particular Schur convex function, several properties of which are established. A metric is deened on the space of partitions, which is useful for comparing diierent clusterings of the same graph. The metric is compared with the metric known as the equivalence mismatch coeecient. The performance criterion and the metric are used for the quantitative measurement of experiments conducted with the MCL algorithm on randomly generated test graphs with 10000 nodes. Scaling the MCL algorithm requires a regime of pruning the stochastic matrices which need to be computed. The eeect of pruning on the quality of the retrieved clusterings is also investigated. Note: Mathematical aspects of the MCL process are described in 8]. The work was carried out under project INS{3.2, Concept Building from Key{Phrases in Scientiic Documents and Bottom Up Classiication Methods in Mathematics. 1. Introduction The Markov cluster algorithm or MCL algorithm was introduced in 6]. Its deenition is repeated here and a summary of results from 6] and 8] is given. The report corresponds with Chapters 9, 10, and 12 in the PhD thesis 7]. The MCL algorithm is basically a shell around an algebraic process deened for stochastic matrices, called the MCL process. The process consists of alternation of two operators, called expansion and innation. The image of a …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cluster-Based Image Segmentation Using Fuzzy Markov Random Field

Image segmentation is an important task in image processing and computer vision which attract many researchers attention. There are a couple of information sets pixels in an image: statistical and structural information which refer to the feature value of pixel data and local correlation of pixel data, respectively. Markov random field (MRF) is a tool for modeling statistical and structural inf...

متن کامل

Graph Clustering by Hierarchical Singular Value Decomposition with Selectable Range for Number of Clusters Members

Graphs have so many applications in real world problems. When we deal with huge volume of data, analyzing data is difficult or sometimes impossible. In big data problems, clustering data is a useful tool for data analysis. Singular value decomposition(SVD) is one of the best algorithms for clustering graph but we do not have any choice to select the number of clusters and the number of members ...

متن کامل

Performance Testing of RNSC and MCL Algorithms on Random Geometric Graphs

The exploration of quality clusters in complex networks is an important issue in many disciplines, which still remains a challenging task. Many graph clustering algorithms came into the field in the recent past but they were not giving satisfactory performance on the basis of robustness, optimality, etc. So, it is most difficult task to decide which one is giving more beneficial clustering resu...

متن کامل

Ricochet: A Family of Unconstrained Algorithms for Graph Clustering

Partitional graph clustering algorithms like K-means and Star necessitate a priori decisions on the number of clusters and threshold on the weight of edges to be considered, respectively. These decisions are difficult to make and their impact on clustering performance is significant. We propose a family of algorithms for weighted graph clustering that neither requires a predefined number of clu...

متن کامل

Comparative Performance Analysis of Rnsc and Mcl Algorithms on Power-law Distribution

Cluster analysis of graph related problems is an important issue now-a-day. Different types of graph clustering techniques are appeared in the field but most of them are vulnerable in terms of effectiveness and fragmentation of output in case of real-world applications in diverse systems. In this paper, we will provide a comparative behavioural analysis of RNSC (Restricted Neighbourhood Search ...

متن کامل

Interference-Aware and Cluster Based Multicast Routing in Multi-Radio Multi-Channel Wireless Mesh Networks

Multicast routing is one of the most important services in Multi Radio Multi Channel (MRMC) Wireless Mesh Networks (WMN). Multicast routing performance in WMNs could be improved by choosing the best routes and the routes that have minimum interference to reach multicast receivers. In this paper we want to address the multicast routing problem for a given channel assignment in WMNs. The channels...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000